Search Efficiency in Indexing Structures for Similarity Searching
نویسندگان
چکیده
Similarity searching finds application in a wide variety of domains including multilingual databases, computational biology, pattern recognition and text retrieval. Similarity is measured in terms of a distance function (edit distance) in general metric spaces, which is expensive to compute. Indexing techniques can be used reduce the number of distance computations. We present an analysis of various existing similarity indexing structures for the same. The performance obtained using the index structures studied was found to be unsatisfactory . We propose an indexing technique that combines the features of clustering with M tree(MTB) and the results indicate that this gives better performance .
منابع مشابه
یک روش مبتنی بر خوشهبندی سلسلهمراتبی تقسیمکننده جهت شاخصگذاری اطلاعات تصویری
It is conventional to use multi-dimensional indexing structures to accelerate search operations in content-based image retrieval systems. Many efforts have been done in order to develop multi-dimensional indexing structures so far. In most practical applications of image retrieval, high-dimensional feature vectors are required, but current multi-dimensional indexing structures lose their effici...
متن کاملOrChem: an open source chemistry search engine for Oracle
BACKGROUND Registration, indexing and searching of chemical structures in relational databases is one of the core areas of cheminformatics. However, little detail has been published on the inner workings of search engines and their development has been mostly closed-source. We decided to develop an open source chemistry extension for Oracle, the de facto database platform in the commercial worl...
متن کاملNew Approaches to Similarity Searching in Metric Spaces
Title of dissertation: NEW APPROACHES TO SIMILARITY SEARCHING IN METRIC SPACES Cengiz Celik, Doctor of Philosophy, 2006 Dissertation directed by: Professor David Mount Department of Computer Science The complex and unstructured nature of many types of data, such as multimedia objects, text documents, protein sequences, requires the use of similarity search techniques for retrieval of informatio...
متن کاملA Uniied Model for Similarity Searching ?
The indexing algorithms and data structures for similarity searching in metric spaces seem to emerge from a great diversity, and diierent approaches have been proposed and analyzed separately, often under diierent assumptions. Currently, the only realistic way to compare two diierent algorithms is to apply them to the same data set. We present a uniied model for studying similarity searching al...
متن کاملModel for Similarity Searching ?
The indexing algorithms and data structures for similarity searching in metric spaces seem to emerge from a great diversity, and diierent approaches have been proposed and analyzed separately, often under diierent assumptions. Currently, the only realistic way to compare two diierent algorithms is to apply them to the same data set. We present a uniied model for studying similarity searching al...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره cs.DB/0403014 شماره
صفحات -
تاریخ انتشار 2004